[cudax] Implement cudax::coop::reduce for warp groups within a block#9258
[cudax] Implement cudax::coop::reduce for warp groups within a block#9258davebayer wants to merge 1 commit into
cudax::coop::reduce for warp groups within a block#9258Conversation
|
Note Reviews pausedIt looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the Use the following commands to manage reviews:
Use the checkboxes below for quick actions:
SummaryThis PR implements Core Implementation Changes
Supporting Changes
Tests
API / ABI Surface Changes
Review Notes / Areas to Verify
important: WalkthroughAdds a warp-scoped two-stage cooperative reduce overload, conditions group mapping/queries on unit hierarchy and adjusts barrier init for per-unit threads, and adds parametrized CUDA tests validating multi-warp reductions across types, ops, and item counts. ChangesCooperative reduce for warps within block
Assessment against linked issues
Possibly related PRs
Suggested labels
Suggested reviewers
Comment |
There was a problem hiding this comment.
Actionable comments posted: 1
🧹 Nitpick comments (1)
cudax/test/coop/reduce/warps_within_block.cu (1)
152-153: ⚡ Quick winsuggestion: The test names and tags still say
this_warpon Lines 152 and 173, but this file/target iswarps_within_block. Rename theC2H_TESTnames/tags towarps_within_blockto keep filtering and failure triage unambiguous.Also applies to: 173-174
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: b9ab43e4-f31a-4ae2-94bb-fdad331fffa3
📒 Files selected for processing (5)
cudax/include/cuda/experimental/__coop/reduce.cuhcudax/include/cuda/experimental/__group/queries.cuhcudax/include/cuda/experimental/__group/synchronizer/barrier_synchronizer.cuhcudax/test/CMakeLists.txtcudax/test/coop/reduce/warps_within_block.cu
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
3aebfa2 to
05ebea1
Compare
This comment has been minimized.
This comment has been minimized.
090635a to
a7140fb
Compare
There was a problem hiding this comment.
Actionable comments posted: 1
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: 1a491f9e-17cf-4aba-a115-6d696e80c0f0
📒 Files selected for processing (6)
cudax/include/cuda/experimental/__coop/reduce.cuhcudax/include/cuda/experimental/__group/group.cuhcudax/include/cuda/experimental/__group/queries.cuhcudax/include/cuda/experimental/__group/synchronizer/barrier_synchronizer.cuhcudax/test/CMakeLists.txtcudax/test/coop/reduce/warps_within_block.cu
🚧 Files skipped from review as they are similar to previous changes (5)
- cudax/include/cuda/experimental/__group/synchronizer/barrier_synchronizer.cuh
- cudax/test/CMakeLists.txt
- cudax/include/cuda/experimental/__group/queries.cuh
- cudax/test/coop/reduce/warps_within_block.cu
- cudax/include/cuda/experimental/__coop/reduce.cuh
🥳 CI Workflow Results🟩 Finished in 35m 02s: Pass: 100%/55 | Total: 8h 09m | Max: 35m 00s | Hits: 71%/46192See results here. |
Fixes #9160.